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Abstract: State estimation is necessary in diagnosing anomalies in Water Demand 
Systems (WDS). In this paper we present a neural network performing such a task. 
State estimation is performed by using optimization, which tries to reconcile all the 
available information. Quantification of the uncertainty of the input data (telemetry 
measures and demand predictions) can be achieved by means of robust estate estima- 
tion. Using a mathematical model of the network, fuzzy estimated states for anomalous 
states of the network can be obtained. They are used to train a neural network capable 
of assessing WDS anomalies associated with particular sets of measurements. 
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1 Introduction 

Water companies use telemetry systems for control and operation purposes. By consid- 
ering the data provided by telemetry, the engineer on duty makes operation decisions 
trying to optimize the system utilization. Nevertheless, the system complexity does 
not permit but to take a few real-time measures, which only incompletely represent the 
network state. They give indication of only certain aspects of the system, leaving out 
other more specific or "less relevant" ones. Thus, suitable techniques that allow for 
more accurate network health estimation are necessary so that anomalies can be de- 
tected more rapidly, and light anomalies, which develop progressively and insidiously, 
can be identified. This will enable to control their consequences in earlier stages, thus 
avoiding, among other things, losses of water, which can be of great importance. 

The state of a WDS is obtained by interrelating different measures within a math- 
ematical model of the network, [Mar95|. Different tools to analyze water networks 
have been developed in the last years, SARA IIGMF98II . and EPANET MRos97l . among 
others. But state estimation cannot be accurately performed if there are missing or 
uncertain data. Thus, system operators need error limits for the state variables. Yet, 
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data are abundant since they are permanently received. Therefore, operators cannot 
evaluate errors easily or in real time. It is expected that suitable techniques borrowed 
from Artificial Intelligence (AI) could encapsulate the necessary knowledge to assess 
the network state. 

In this paper, we present an approach for the diagnosis and decision making pro- 
cess which is necessary on a neural network for clustering and pattern classification. 
First, the mathematical model, a state estimation procedure and a mechanism for treat- 
ing uncertainties, already presented in [Izq04] and |Izq05|, are briefly presented. The 
state estimator, together with the error limits will be used as a surrogate of the real 
WDS to generate data to train and check the neural network (NN). Then, the inherent 
procedures to neural techniques will be described. Specifically, the NN architecture, 
the classification and clustering mechanisms of both, crisp and fuzzy, patterns and the 
training technique will be presented. 



2 Mathematical Model and State Estimation 

Analyzing pressurized water systems is a complex task, especially for big systems. 
But even for moderately sized cities, it involves solving a big number of non-linear 
simultaneous equations. The complete set of equations may be written by using block- 
matrix notation, 



An(q) Axi \ ( q \ _ ( -A w Hf 
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(1) 



where A12 is the so-called connectivity matrix describing the way demand nodes are 
connected through the lines. Its size is L x N p , N p being the number of demand nodes 
and L the number of lines; q is the vector of the flow rates through the lines, H the 
vector of unknown heads at demand nodes; A10 is an L x Nf matrix, Nf being the 
number of fixed-head nodes with known head Hf, and Q is the A^-dimensional vector 
of demands. Finally, Au(q) is an L x L diagonal matrix. System ([TJ is a non-linear 
problem whose solution is the state vector x = (q, H)* of the system. 

The non-linear relations describing the system balances are complemented by the 
specific telemetry measurements. These measurements are integrated into the model 
by expanding system (fl]i to a new system, typically overdetermined: 

Au(q) A 12 

A{ 2 I ( «. ) = I Q I . (2) 
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The components A31 and A32 in system <(2J were introduced to account for additional 
telemetry measurements Mt with uncertainties in the demand predictions. System (f2j 
is usually solved using least-square methods for a state estimation by an over-relaxation 
iterative process applied to a linearized version of (fJJ: 

A' n (qW) A\2 \ / » \ / -A w H f - A n (qW)qW - A 12 HW 
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where A' n is the Jacobian matrix corresponding to An. 

3 Error Limit Analysis 

Error limit analysis is a process to determine uncertainty bounds for the state estimation 
originated by the lack of precision of measurements and data. To put it in a nutshell, 
the question is what is the reliability of the estimated state x* , if measurement vectors 
y are not crisp but may vary in some region, [y — Sy, y + Sy]l 

Different techniques may be used to estimate this unknown but bounded error, 
|Mil96|, |Nor86|, |Kur97|. We use a variant of the so-called sensitivity matrix analysis, 
MBar03l . which uses the state estimator presented above. 

In |Izq05], it is proved that a component by component bound, e*, for 5x* can be 
obtained by means of 

e* = \(A?WAty 1 A?W\\6y\, (4) 

where W is a diagonal matrix that weights the equations according to the nature of 
the right-hand sides, and the vertical bars indicate absolute values of all matrix and 
vector entries. Because of linearity, the bounds calculated by (|4]i are symmetrical and 
the error limit may be expressed as a multidimensional interval (see cell definition in 
next section) [x*] in the state space 

[x*]= [x\ n{ ,x* sup ] = [x*-e*,x*+e*]. (5) 

4 The Neural Network 

A neural network for clustering and classification is a mechanism for pattern recog- 
nition. Here, we use multidimensional cells, [Sim92|, |Lik94|. Voronoi diagrams are 
used inRef. HBle97l . 

A cell C is a region of the pattern space of n-dimensional vectors obtained as the 
intersection of n pairs of half-spaces of the form mj < Xi < Mi, for i — 1,2, ... ,n, 
where rrii and M$ are real numbers. Vectors m = (nii, i — 1, . . . , n) and M = 
(Mi, i = 1, . . . , n) are called min and max points of C and completely determine C. 
Membership of patterns to a cell is defined from fuzzy grounds. For fuzzy patterns, 
P = [P inf ,P su P], like the ones obtained in (0, membership values are given by the 
membership function 

c(P) = . max^ { max (F/ up - Mj) , W (m t - P/ nf ) } }, (6) 

where each <fii(x) controls the cell fuzziness. 

Values taken by membership function © are used during the operation phase to 
decide the membership degree to the class associated with a cell exhibited by certain 
pattern presented to it and, as a consequence, to recognize the potential anomalous state 
of the water distribution system corresponding to the associated label of each class. 
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Patterns presented to the network during the training phase are ordered pairs (P, I), 
where I is a label associated to pattern P describing the type of anomaly it represents. 

The NN implementing the classification process is a three-layer network that grows 
adapting itself to the problem characteristics. The input layer has 2n neurons, two 
for any of the dimensions of the patterns P = [P lnf , P sup ] . When a new pattern 
is presented to the network through the input layer, the components of vectors P mf 
and P su p are compared, respectively, with those of the minimum point, m, and the 
maximum point, M, of the J existing cells. Specifically, numbers in the inner brackets 
of (|6]l are calculated. 

This way, each neuron on the hidden layer has two n-dimensional vectors ip mf and 
tp sup as its input, formed by numbers between and 1, ready to be processed, first 
component by component with the max operator, and then with the max operator, but 
now through all the components. Specifically, 



is calculated for each cell. This process gives the membership degree of P to every one 
of the cells. Thus, membership functions may be considered as the transfer or activation 
functions for all the J existing hidden neurons. And the values of the minimum and 
maximum points of those existing cells, which will be adjusted during the training 
phase, must be regarded precisely as the synaptic weights between the input and the 
hidden layer. 

The values produced by the membership functions of the existing cells constitute 
the outputs of the hidden layer. These values must be operated with the weights be- 
tween the hidden and the output layers. This process will produce a class, a diagnosis of 
the hydraulic system represented by pattern P. This procedure facilitates the decisions 
to be made by the system managers. 

5 Conclusions 

The described neural procedure does not fit into any standard paradigm, since it is made 
of several sub-nets that evolve by accumulating experience as new loads (peak, valley, 
seasonal-dependent, etc.) are observed, which mimics human knowledge acquisition. 

From the reduced number of tests performed we conclude that the classification 
ability of the NN is excellent. Since the response given by the NN is graded, as a 
consequence of its fuzziness, the information it provides is not only qualitative (point- 
ing out an anomaly) but also quantitative (weighting the distributed importance of the 
problem). 

The tool presented here, once completed, calibrated and implemented, will provide 
WDS managers with a decision support mechanism allowing early identification of 
anomalies and, as a consequence, better Integrated Water Management. 
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